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Applicant's or agent's file reference 
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Box No. I TITLE OF INVENTION 

EVALUATION AND SELECTION OF PROGRAMMING CODE 



Box No. II APPLICANT 



[ [ This person is also inventor 



Nam e and address : (Family name followed by given name; for a legal entity, full official designation. 
The address must include postal code and name of country. The country of the address indicated in this 
Box is the applicant 's State (that is, country) of residence if no State of residence is indicated below J 

INTEL CORPORATION 
2200 Mission College Boulevard 
Santa Clara, California 95052 
United States of America 



Telephone No. 



Facsimile No. 



Teleprinter No. 



Applicant's registration No. with the Office 



State (that is, country) of nationality: 

us 



State (that is, country) of residence: 

us 



This person is applicant 
for the purposes of: 



| j all designated 



States 



T7] all designated States except 
A I the United States of America 



□ the United States 
of America only 



□ the States indicated in 
the Supplemental Box 



Box No. Ill FURTHER APPLICANT(S) AND/OR (FURTHER) INVENTOR(S) 



Name and address: (Family name followed by given name; for a legal entity, full official designation. 
71ie address must include postal code and name of country. Vie country of the address indicated in this 
Box is the applicant 's State (that is. country) of residence if no State of residence is indicated below.) 

HUANG, Bo 

Room 803, Building 31, No 1555, North of Kai Xuan Road, 
Shanghai 200063, P. R. China 



This person is: 

| | applicant only 

\yC I applicant and inventor 

I H inventor only (If this check-box is 
I I marked, do not fill in below.) 



Applicant's registration No. with the Office 



State (that is, country) of nationality: 

CN 



State (that is, country) of residence: 

CN 



This person is applicant I 1 all designated I I all designated States except F^~\ the United States 1 I the States indicated in 
for the purposes of: L I States I I the United States of America L*J of America only I 1 the Supplemental Box 



yC Further applicants and/or (further) inventors are indicated on a continuation sheet. 
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The person identified below is hereby/has been appointed to act on behalf rZ71 agent 

of the applicant(s) before the competent International Authorities as: 1* I 



□ common 
representative 



Name and address: (Family name followed by given name; for a legal entity, full official designation. 
The address must include postal code and name of country.) 

IntellecPro China Limited 
Suites 902-908, Ping'an Mansion, 
23 Jinrong Dajie, Xicheng District, 
Beijing 100032, P. R. China 



Telephone No. 

86-10-66215588 



Facsimile No. 

86-10-66210771 



Teleprinter No. 
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11015 
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SUN, Yuanhao 

N60 Apt 602, Lane 300 Pingyang Road, Shanghai 201 102, 
P. R. China 


This person is: 

| | applicant only 

\X [ applicant and inventor 

| I inventor only (If this check-box 
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Applicant's registration No. with the Office 


State (that is, country) of nationality: 

CN 


State (that is, country) of residence: 

CN 


This person is applicant I I all designated 1 1 all designated States except f"yi the United States 1 | the States indicated in 

for the purposes of: 1 1 States • | | the United States of America 1 A 1 of America only | | the Supplemental Box 
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The address must include postal code and name of country. The country of the address indicated in this 
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GAO, Lei 

Room 1-502, No. 62, Furongjiang Road, Shanghai 200336, 
P. R. China 


This person is: 

| [ applicant only 

|X | applicant and inventor 

I | inventor only (If this check-box 
I 1 is marked, do not fill in below.) 


Applicant's registration No. with the Office 


State (thai is, country) of nationality: ' 

CN 


State (that is, country) of residence: 

CN 


This person is applicant 1 1 all designated | j all designated States except r^~| the United States I | the States indicated in 

for the purposes of: I I States [ | the United States of America LflJ of America only | | the Supplemental Box 


Name and address: (Family name followed by given name; for a legal entity, full official designation. 
The address must include postal code and name of country. The country of the address indicated in this 
Box is the applicant 's State (that is, country) of residence if no State of residence is indicated below.) 

LI, Jianhui 

Room 504, No. 3, LanCun Road, Shanghai 200127, P. R. 
China 


This person is: 

| | applicant only 

| X I applicant and inventor 

1 j inventor only (If this check-box 
I I is marked, do not fill in below.) 


Applicant's registration No. with the Office 


State (that is, country) of nationality: 

CN 


State (that is, country) of residence: 

CN 


This person is applicant j | all designated I 1 all designated States except [wTl the United States I | the States indicated in 

for the purposes of: I I States | | the United States of America [£ I of America only \ | the Supplemental Box 


Name and address: (Family name followed by given name; for a legal entity, full official designation. 
The address must include postal code and name of country. The country of the address indicated in this 
Box is the applicant s State (that is. country) of residence if no State of residence is indicated below.) 


This person is: 

j ] applicant only 

j | applicant and inventor 

j | inventor only (If this check-box 
I 1 is marked, do not fill in below.) 


Applicant's registration No. with the Office 


State (that is, country) of nationality: 


State (that is, country) of residence: 


This person is applicant I I all designated J 1 all designated States except i 1 the United States J j the States indicated in 

for the purposes of: I I States " | | the United States of America | | of America only | | the Supplemental Box 


I | Further applicants and/or (further) inventors are indicated on another continuation sheet. 
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No. V DESIGN ATIOiNS 



The filing of this request constitutes under Rule 4.9(a), the designation of all Contracting States bound by the PCT on the international 
filing date, for the grant of every kind of protection available and, where applicable, for the grant of both regional and national patents. 

However, 

I | DE Germany is not designated for any kind of national protection 

| | KR Republic of Korea is not designated for any kind of national protection 

| | RU Russian Federation is not designated for any kind of national protection 

(Tlie check-boxes above may be used to exclude (irrevocably) the designations concerned in order to avoid the ceasing of the effect, under 
the national law, of an earlier national application from which priority is claimed. See the Notes to Box No. V as to the consequences 
of such national law provisions in these and certain other States.) 



Box No. VI PRIORITY CLAIM 



The priority of the following earlier application(s) is hereby claimed: 



Filing date 
of earlier application 
(day/month/year) 



Number 
of earlier application 



Where earlier application is: 



national application: 
country or Member 
of WTO 



regional application:* 
regional Office 



international application: 
receiving Office 



item(l) 



item (2) 



item (3) 



| | Further priority claims are indicated in the Supplemental Box. 



The receiving Office is requested to prepare and transmit to the International Bureau a certified copy of the earlier application(s) (only 
if the earlier application was filed with the Office which for the purposes of this international application is the receiving Office) identified 
above as: 

□ all items □ item(l) □ item (2) □ item (3) 

* Where the earlier application is an ARIPO application, indicate at least one country party to the Paris Convention for the Protection of 
Industrial Property or one Member of the World Trade Organization for which that earlier application was filed (Rule 4.10(b)(ii)): 



f ~\ other, see Supplemental Box 



Box No. VII INTERNATIONAL SEARCHING AUTHORITY 



Choice of International Searching Authority (ISA) (if two or more International Searching Authorities are competent to carry out the 
international search, indicate the Authority chosen; the two-letter code may be used): 



.C.N 



ISA/. 

Request to use results of earlier search; reference to that search (if an earlier search has been carried out by or requested from the 
International Searching Authority): 

Date (day/month/year) Number Country (or regional Office) 



Box No. VIII DECLARATIONS 



The following declarations are contained in Boxes Nos. VIII (i) to (v) (mark the applicable 
check-boxes below and indicate in the right column the number of each type of declaration): 

| | Box No. VIII (i) Declaration as to the identity of the inventor 

| | Box No. VIII (ii) Declaration as to the applicant's entitlement, as at the international filing 

date, to apply for and be granted a patent 

| | Box No. VIII (iii) Declaration as to the applicant's entitlement, as at the international filing 

date, to claim the priority of the earlier application 

Q Box No. VIII (i v) Declaration of inventorship (only for the purposes of the designation of the 

United States of America) 

| | Box No. VIII (v) ■ Declaration as to non-prejudicial disclosures or exceptions to lack of novelty 



Number of 
declarations 



Form PCT/RO/101 (second sheet) (April 2005) 



See Notes to the request form 



Sheet No. 



TNo. IX CHECK LIST; LANGUAGE OF FILING 



This international application contains: 

(a) on paper, the following number of 
sheets: 

request (including 
declaration sheets) 

description (excluding 
sequence listing and/or 
tables related thereto) 

claims 

abstract 

drawings 

Sub-total number of sheets 
sequence listing 
tables related thereto 
(for both, actual number 
of sheets if filed on paper, 
whether or not also 
. filed in electronic form; 
see (c) below) 

Total number of sheets 

(b) □ only in electronic form 
(Section 801(a)(0) 

(i) C] sequence listing 

(ii) □ tables related thereto 

(c) □ also in electronic form 

(Section 801(a)(ii)) 

(i) □ sequence listing 

(ii) □ tables related thereto 

Type and number of carriers (diskette, 
CD-ROM, CD-R or other) on which are 
contained the 

□ sequence listing: 

□ tables related thereto: 

(additional copies to be indicated under 
items 9(ii) and/or 10(H), in right column) 





1. S3 


4 


2. □ 




3. □ 


15 


4. □ 


5 




. 1 


5. □ 


5 


6. □ 


30 


7. □ 




8. □ 




9. □ 
(<) 


30 


(ii) 




(Hi) 



This international application is accompanied by the following 
item(s) (mark the applicable check-boxes below and indicate in 
right column the number of each item): 
fee calculation sheet 
original separate power of attorney 
original general power of attorney 
copy of general power of attorney; reference number, 

if any: 

statement explaining lack of signature 
priority document(s) identified in Box No. VI as 



Number 
of items 



ity ( 
item(s): 

translation of international application into 

(language): 



separate indications concerning deposited microorganism 
or other biological material 

sequence listing in electronic form 
(indicate type and number of carriers) 

□ copy submitted for the purposes of international search under 
Rule 13/eronly (and not as part of the international application) : 

□ (only where check-box (b)(i) or (c)(i) is marked in left column) 
additional copies including, where applicable, the copy for the 
purposes of international search under Rule 1 3ter 

□ together with relevant statement as to the identity of the copy or 
copies with the sequence listing mentioned in left column 

10. □ tables in electronic form related to sequence listing 

(indicate type and number of carriers) 

(i) □ copy submitted for the purposes of international search under 

Section %Q2(b-quater) only (and not as part of the international 
application) 

(ii) □ (only where check-box (b)(ii) or (c)(ii) is marked in left column) 

additional copies including, where applicable, the copy for the 
purposes of international search under Section $Q2(b-quater) 

(iii) □ together with relevant statement as to the identity of the copy or 

copies with the tables mentioned in left column : 

11. □ other (specify): 



Figure of the drawings which 
should accompany the abstract: 



Language of filing of the 

international ; 



Box No. X SIGNATURE OF APPLICANT, A GEN 

Next to each signature, indicate the name of the person signing and 




English 



RESENTATIVE 

^rsj^sjgns (if such capacity is not obvious from reading the request). 



IntellecPro China Limited 



1. Date of actual receipt of the purported h f\ f\ tZS 
| international application: ^ ^ ^ (3 Q * 1 2 0 2 0 U ^ 


2. Drawings: 
| | received: 

| | not received: 


3 . Corrected date of actual receipt Sue to later but 
timely received papers or drawings completing 
the purported international application: 


4. Date of timely receipt of the required 
corrections under PCT Article 1 1(2): 


5. International Searching Authority 

(if two or more are competent): - ISA / 


6. 1 — 1 Transmittal of search copy delayed 
I | until search fee is paid 



For International Bureau use only 



Date of receipt of the record copy 
by the International Bureau: 
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CALCULATION OF PRESCRIBED FEES 
1. TRANSMITTAL FEE 



2. SEARCH FEE 

International search to be carried out by 



CN 



the international search.) 
INTERNATIONAL FILING FEE 

Where items (b) and/or (c) of Box No. IX apply, enter Subtotal number of sheets j 
Where items (b) and (c) of Box No. IX do not apply, enter Total number of sheets ) 



fnl first 30 sheets 

QU. 



number of sheets 
in excess of 30 



fee per sheet 



\u\ additional component (only if a sequence listing ^nd/or tables 

related thereto are filed in electronic form under Sectior 80?(a)(i), 
or both in that form and on paper, under Section 801(aXn))- 



400 x 



fee per sheet 

Addamountsenteredatil,i2andi3andentertotalatI .... 

(Applicants from certain States are entitled to a reduction ff 75% of the 
international filing fee. Where the applicant is (or all applicants are) so 
entitled the total To be entered at I is 25% of the international filing fee.) 



4. FEE FOR PRIORITY DOCUMENT (if applicable) 



5. TOTAL FEES PAYABLE 

Add amounts entered at T, S, I and P, and enter total in the TOTAL box 




MODE OF PAYMENT (Not all modes of payment may be available at all receiving Offices) 

authorization to charge r~) postal money order Q cash 

^ deposit account (see below) 

□ cheque □ bank draft □ revenue stamps 



| | coupons 
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AUTHORIZATION TO CHARGE (OR CREDIT) DEPOSIT ACCOUNT 
(This mode of payment may not be available at all receiving Offices) 

(Jf] Authorization to charge the total fees indicated above. 

n (This check-box may be marked only if the condition, ^ deposit accounts 

U of the receiving Office so permit) Authorization to charge any deficiency 

or credit any overpayment in the total fees indicated above. 
Q Authorization to charge the fee for priority document. 



Receiving Office: RO/__CN_ 

Deposit Account No.: 

Date: 



Name: 



Signature: 
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EVALUATION AND SELECTION OF PROGRAMMING CODE 

An embodiment of the invention is directed to the generation and 
optimization of computer programming code. Other embodiments are also 
described. 

BACKGROUND 

When an application program is launched or run in a computer, the 
computer is executing what is referred to as a binary image (or simply, binary) 
of the program. That is not, however, the version in which the program was . 
originally created by its author. Due to the inherent design and complexity of a 
computer, programs are written using a higher level programming language- 
that is more readily understandable to a human programmer. A program is 
initially written in what is called a source programming language (resulting in 
source code or a source file). It is then translated down into the binary image 
version (also referred to as the executable or executable file) before being 
loaded into the computer's memory for execution. Software programs or tools, 
referred to collectively here as code generators, are used by the programmer to 
perform this translation. A code generator is selected that is able to translate a 
particular source file into an executable file that is to be run on a given 
computer hardware platform (e.g., one that is based on a Pentium® processor 
by Intel Corp., Santa Clara, California). 

A code generator may have the following components. A compiler 
translates one or more input source files that are written in a high level 
language (e.g., C; C++; Fortran; Pascal; Basic; as well as others) into object code 
or object files which are in a low level language referred to as machine 
language. Next, a linker joins the object files, together with library object files 
that have been previously compiled, into a binary image (the executable file). 
The binary may then be loaded into the main memory of the computer and 
executed by one or more of its processors. 
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Modern integrated circuit technologies used in advanced computer 
components are being adopted at a rapid pace. Advances are being raprdly 
m ade in computer platform architectures, such as one based on a Pentium® 
processor, and new hardware components are being designed and 
m anufactured that allow the same platform to be applied to different fields. 
These include, for example, personal computer (PC) desktops, laptops, home 
entertainment PCs, servers, home appliances, dedicated video game machmes, 
and mobile held-held devices such as cellular telephones and muMfuncnon 
personal digital assistants (PDAs). Different fields, however, present drfferen, 
requirements for the binaries that will be running on top of the hardware 
platform. For example, a program that is to run on a server is expected to have 
high performance while it is running, while programs that are for mobtle 
devrces may have more stringent code size as well as power consumpbon 
constraints. In other instances, a program is to be stored in non-volatile, sohd 
state memory of the platform, which has even more stringent limits on storage 
space due to cost concerns. Such programs are sometimes referred to as 
firmware, and may need to be compressed, prior .0 being stored. 

Current code generation tools, including compilers, linkers, and binary 
optrmizers, prov.de optimization control that can be selected by the user in an 
effort .0 generate code that has a higher performance, smaller code stze, or 
l0 „er power consumption. A binary optimizer, also sometimes referred to as a 
post-link optimizer, is a tool that is used to improve the performance of a 
pro-am after it has been comptled and linked. The tool directly operates on 
the executable file and is thus said ,0 rewrite the executable, in accordance wrth 
certain user specified optimization controls. Each of these tools may expose us 
own set of optimization controls to the user. 

■ Current code generation tools, however, do not provide a systematic 
and automated approach to meet sophisticated code generation requirements. 
For example, the current tools do not allow the user to specify simultaneously 
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both a code size optimization setting, i.e. one that is expected to reduce the size 
of the binary, and a performance optimization setting, i.e., one that is expected 
to increase the performance of the binary. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The embodiments of the invention are illustrated by way of example 
and not by way of limitation in the figures of the accompanying drawings in 
which like references indicate similar elements. It should be noted that 
references to "an" embodiment of the invention in this disclosure are not 
necessarily to the same embodiment, and they mean at least one. 

Fig. 1 is a block diagram of a system for evaluating and selecting a 
binary based on its figure of merit, in accordance with an embodiment of the 
invention. 

Fig. 2 is a block diagram of an example system for generating the 
binaries. 

Fig. 3 shows another example system for generating the binaries. 

Fig. 4 is a flow diagram of a methodology for generating multiple 
binaries and then selecting or ranking them, in accordance with their figures of 
merit. 

Fig. 5 is a block diagram of a full featured system for code generation, 
evaluation, and selection, in accordance with an embodiment of the invention. 

Fig. 6 is a block diagram of a computer on which a software tool, in 
accordance with an embodiment of the invention, can run. 

DETAILED DESCRIPTION 

Fig. 1 is a block diagram of a system for evaluating and selecting a 
binary based on its figure of merit, in accordance with an embodiment of the 

3 
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invention. A first evaluator 104 measures a first characteristic of several input 
binaries 106. The evaluator 104 computes a number of first figures of merit 
(FOMs) 108 for the input binaries 106. In other words, FOM1 is the computed 
figure of merit for binary 1, FOM2 is the computed figure of merit for binary 2, 
etc. In this example, there are four binaries 106 illustrated, however, there may 
be as few as two or more than four, depending on how fine-grained the 
available optimization controls are. Different techniques for generating the 
binaries 106 will be described below. 

Each of these input binaries 106 is generated with a different, code 
generator optimization setting, for the same processor instruction set 
architecture. The input binaries 106 may also be based on the same set of one 
or more source files (not shown). The binaries 106 may all be generated using 
the same code generator tool set, configured according to different 
optimization setting. The tool set may include components from different 
software vendors {e.g., a compiler and linker from one vendor, and a binary 
rewriter from another). Note that the input binaries 106 may be generated 
either manually by the user one at a time, or as described below automatically 
according to a script. 

The evaluator 104 in this example is to measure the performance of each 
input binary. This may be done by having the binary be executed by a 
hardware platform that implements the processor instruction set architecture 
for which the binary has been generated. Alternatively, the evaluator 104 may 
include a software simulation tool, which simulates the hardware platform, 
including the processor and I/O device resources that are present in the actual 
hardware platform. The binary is thus executed on its intended hardware 
platform, either actually or though simulation, and its performance is 
measured. Performance may be measured by feeding the running binary a 
predefined set of inputs and measuring how fast the expected outputs are 
produced. The measured performance is then translated into the FOM 108. 



4 
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Por examp.e, faster execution of a parncu.ar task may fians.ate to a .owe, FOM, 
wh „e s.ower execufion of the same task trans,a,es to a higher FOM. The best 
binary in .hat case would be the one with the .owest FOM 108. 

The computed FOMs 108 are fed to a binary selector 110, whrch 

FOM 112. Although various ways of denning the FOMs and overaU FOMs 
possible, an easy to implement approach is to define each FOM as bang a 
plve integer As a srmp.e examp.e, the overal, FOM 112 may be the same as 
its corresponding FOM 108, that is, 

(Equation 1) 

overall FOM1 = FOM1, 
overall FOM2 = FOM2, etc. 

In that case, binary selector 110 performs a straight ranking of the 
overall FOMs 112 and. in this example, determines that overall FOM3 
corresponding to binary 3, has the highest or iowest va.ue. The binary selector 
HO thus indicates to the user that binary 3 is the "best" of the four mput 
binaries 106. from the standpoint of performance (being the measured 
cbaracterisfic). The combination of the evaluator 104 and bmary selector HO 

image in a systematic manner. This genera! framework allows sophrshcated 
cod e generation requirements to be evafuated. using multiple evadors as 
described below. 

H.. 1 shows an example of a system with multiple evaluators. In 
addifionlo contaimng evaiuator 104 described above, the system has another 
equator, evaluator 116 which measures another characteristic of the utpu, 
binaries 106, namely their compressed file sizes. Each input bmary 106 » 
compressed, and the size of the resulfing compressed file is translated by the 
evaluator 116 to a second FOM 120. This may be performed usmg a 
convenfiona,. software compression too. that is surtab.e for compressmg da a 
in the form of a binary (machine executab.e data). This type of eva.ua,or 

c 



useful when the code generation requirements (of a particular field of use) call 
for a binary that has a maximum compressed file size. For example, the binary 
may include a firmware driver program that is to be stored in non-volatile, 
solid state memory of a computer (e.g., a basic I/O system, BIOS, a boot routine, 
or a network management program). 

The FOMs 120 are also fed to the binary selector 110 which compares the 
FOMs 120, while aiming at selecting the binary with the lowest or highest 
overall FOM. The "comparison" involving the FOMs 108 and FOMs 120 is 
broadly defined here, and may be implemented in several ways. As one 
example, an overall FOM is computed for each input binary 106, as a function 
of the FOM 108 and FOM 120. This may be a simple equation such as 

overall FOM1 = FOMlperf. + FOMl C om P r.size (Equation 
2) 

overall FOM2 = FOM2 per f. + FOM2com P r.si2e 

In yet another alternative, the comparison amongst the different FOMs 
may use the concept of a vector for each binary. For example, 



overall FOM1 = square_root(FOMl 2 per f. + FOMl 2 C om P r.size) (Equation 3) 
overall FOM2 = square_root (FOM2 2 per f. + FOM2 2 C om P r.siz e ) 

If the performance FOM is defined as above, namely, the greater the 
performance of a binary, the smaller its associated FOM 108, then the 
compressed size FOM should be defined so that the smaller the compressed file 
size of a binary, the smaller its associated FOM 120. This approach thus defines 
the "best" overall FOM as the one having the lowest value. 

Note that in the comparisons described above involving two measured 
characteristics, the equations for overall FOM weight the performance and 
compressed size FOMs equally. As an alternative, the equation may specify 
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lcn CTivp , he US er the option to manually dehne the 
nossible. The system may also give the user me r 

P « „ ,11 FOM) e ? in view of a particular held of 

overall FOM (a "configurable" overall FOM), e.g. m 

use. 

This methodology may be extended ,0 more than two evafuators. For 
example, in Fig- a there is a third — U8 that measures the power 
consumption of each binary 106, and computes a respecttve FOM 122 as 
action or that measurement. The evaluator U8 can measure the power that 
consumed oy — each binary with the same tas, and on e sa» 
hardware plat f orm (either in actuality or via a —on, The FOMs 12 
computed by the eva.uatcr 118 are a!so fed to the binary setector 110. 
La y se, el HO compares the FOMs 122, either amongst themseWes or to 
L POMS 108 and/or POMs 120, ,0 compute an overall FOM 112 or each 
La, ( ,,, as a function of all fh.ee measured — cs of 
Consistent with the definition of the overall POM given above, the third FOM 
rmaybedefinedsuchthattKehigherd.e.neasuredpowercons^phon.the 

gr eater the FOM !22, and the lower the measured power- consunrpb on, th 
lower the FOM 122. This definition is consistent with the concept cost 
the FOM (described below). 

In addition to performance, compressed file size, and power 
consumpfion, the measured characteristics may also include code size *. « 

code and/or data portion of the binary once it is loaded and runnmg on 
tended hardware platform). Any two or more of such measured 



cKa.actens.ics may be evalua.ed, by including two or more corresponding 
eva.ua.ors wittun the system. A system may be delivered tha, is custom 
designed wtth only two evadors, whereas a My featured system may have 
iLrmore evadors integrated in the same software too.. As yet another 
atternanve, the system may be delivered with only a smgle equator. A 
system with multiple evaluators can be advantageously used for generatmg 
ophmized binary images in more than one field of use. 

Turnmg now to Fig. 2, Fig- 2 is a Ho* dtagram of an example system 
for generating the binaries. The code generator in mis instance includes a 
compiler 204, and a linker 206 that processes an output of the corner to 
produce the input binaries 106 based on the same source program « 
Lee program 202 may be in the form of one or more source files. The fit** 
2 06 links two or more object files and libraries into a resuming binary unage. 
The compiler 204, as well as in some instances, the linker 206, exposes 
re ,a„ve,y fine-grained optimization controls to the user. As an example ^ such 
controls include the swrtching on or off of loop-uruolhng ophmtzauon, turn ng 
on or off veCorizatron, and turning on or off constan. propagation. Current 
confers and ankers have a relatively limited number of additional 
optimization controls tha, are exposed to the user. 1, is expected that addmg 
JL optimizafion controls to such tools will further improve the resulhng 
binary image tha. is selected by .he processes described here. 

The system in Fig. 2 also includes a scrip, processor 208 that processes 
an inpu, scrip, 220, The scrip. 2X0 may have been authored by the user, or * 
ma y have been generated automatically by another program. The scnpt 
processor 208 reads two or more opumizafion combinauons or optinuzarton 
settings from .he inpu. script 2X0, and configures the comp.ler 204 (and 
perhaps the linker 206), in accordance with a given setting. Once configured m 
Ls way, the compiler and linker are .hen instructed (e.g., by the scnp, 
processor 208, to produce an inpu. binary !06, based on .he source program 



202 as input. The script processor 208 may then step through each subsequent 
optimization setting that is specified in the script 210, instructing the compiler 
204 and linker 206 to produce further binaries 106 one after the other (again 
based on the same source program 202). The system thus automatically 
executes the wishes specified in the script 210, and generates a number of 
binaries 106 that are then evaluated by the system depicted in Fig. 1 described 
above. 

Each optimization setting is different than another, and may be defined 
based on the user's knowledge of what each optimization setting is expected to 
accomplish in a general sense (in terms of the associated binary being more 
suitable for a given field of use). With the help of the evaluator and binary 
selector of Fig. 1, the user can in effect request optimizations that traditionally 
would not be allowed to be performed simultaneously by a conventional tool. 
For instance, there may be a combination optimization setting that specifies a 
compiler control that is expected to generate faster but more voluminous code, 
combined with a control that is expected to produce smaller code. Using the 
systematic approach of the evaluator and binary selector, the net effect of 
several of such optimization settings are evaluated and compared (through the 
FOM mechanism described above) to find the "best" one. This systematic 
process helps remove some of the guess work that may otherwise be 
unavoidable while trying to find the best optimization setting for a particular 
field of use. 

Turning now of Fig. 3, another embodiment of the invention is depicted 
where the binaries 106 are generated by a binary rewriter 304 based on the 
same, "initial" binary 302. The binary rewriter 304 may be a conventional, 
binary rewriting tool, e.g., a static binary translator with both its input binary 
and its output binary targeting the same instruction set architecture. The 
binary rewriter 304 exposes optimization controls to the user that, in this 
embodiment, are received from a script processor 308 that has read the user's 



\ 



generator exposes more fine-grained op to be mad e, 

* different optimization combinations 10 



by the evaluators. 



of thrs were g ^ ^ ^ ^ optlIMZaBons are 

binary rewnrei (Fig. as wi thout recompiling 

is its compressed file size, i rr H pfined in view of the 

and its associated POM may vary and canbe easily dehned, 
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range of the characteristic being measured (e.g., the longest and shortest run 
times expected for any given input binary) and a defined range for the FOM. 

In operation 408, the current overall FOM is compared with a prior 
overall FOM. The latter is an overall FOM that may have been previously 
computed and that is associated with a prior version of the binary. For the 
initial pass, there may be no prior computed FOM, such that operation 408 may 
be skipped. 

After the comparison in operation 408, if there are further optimization 
settings to be evaluated (operation 412) then the process cycles, with another 
optimization setting. This time, operation 408 is performed since there is a 
prior overall FOM available now. Note that in subsequent cycles, operation 
408 may involve multiple comparisons between the current overall FOM and 
each of several prior overall FOMs that are stored. 

Once the last optimization setting has been evaluated, the loop is exited 
at operation 412, and the process proceeds with either operation 43 4 and /or 
operation 416. In the former/the system indicates to the user which version of 
the input binaries has the highest or lowest ("best") overall FOM, as 
determined from the comparisons that were made in the iterative process. 
Note that the system may be designed such that only the binary that has the 
highest or lowest overall FOM value at any given point in the iterative process 
is saved (thereby helping conserve memory resources). 

In addition to, or as an alternative to, operation 414, there is operation 
416 in which the system can display to the user a ranking of the different 
binary versions, in accordance with their overall FOMs, e.g. from highest to 
lowest. This embodiment of the invention allows the user to quickly determine 
how "far apart" the different versions of the binaries are from each other in 
view of the respective optimization settings used to generate them. Other ways 
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of displaying the results of the comparison performed by the binary selector 
are possible. 

The flow diagram of Fig. 4 also shows operation 406 which is performed 
in situations where there is at least one further characteristic that is to be 
measured for each input binary. In that case, the FOM comparisons of 
operation 408 may be carried out as vector operations. For example, an 
overall FOM can be computed as a function of all of the FOMs for the current 
version, as in Equation 3 above. This value is then compared with a prior, 
overall FOM. The latter is computed as a function of all of the FOMs of a prior 
version. 

Turning now to Fig. 5, a block diagram of a full featured system for code 
generation, evaluation, and selection, in accordance with an embodiment of the 
invention is shown. In contrast to the embodiments described above in Figs. 2 
and 3, the code generator in this case includes all three code generation 
components, namely, compiler 204, linker 206, and binary rewriter 304. Each 
optimization setting in this instance may include controls for any one or all of 
the compiler, linker, and binary rewriter. Each new binary image at the output 
of binary rewriter 304 is fed to each one of a number of cost evaluators 508. 
The cost evaluator 508 computes a cost, as a function of a measured 
characteristic of the binary. Here, "cost" is not limited to a monetary amount 
that is to be paid or charged. Rather, it is used more generally to refer to any 
outlay or expenditure (as an effort or sacrifice) made to achieve an object. 
Alternatively, cost represents a loss or penalty that is incurred by the measured 
characteristic of the binary. Under this approach for the FOM, the binary that 
is associated with the lowest cost, or the lowest overall cost in the case where 
multiple factors are to be taken into consideration, becomes the best binary. 
This is determined by the binary selector 510 which compares the computed 
costs and selects the binary having the lowest overall cost. An example pseudo 



12 



*u>MMx,jm® i y, u, 3 }$ 4 

if 

code that describes the framework of cost-driven code generation through an 
iterative exploration process (based on the concept of Fig. 5) is given below. 

for (each optimization combination control of the compiler) 

Compile the source codes with the current compilation optimization control; 
for (each optimization combination control of the linker) 

Link the program PI using the current linking optimization control; 
for (each optimization combination control of the binary rewriter) 

' Rewrite the program PI using the current optimization control 

for the binary rewriting, generate new binary image P2; 
Evaluate the cost(s) of P2; 

Perform binary selection between P2 and the old binary image 
kept in the Binary Selector; 

} 

} 

//The final binary image kept in the binary selector is the best code generated by the 
system 

The above described pseudo code thus provides the most optimized 
(lowest cost) code, by rewriting the binary multiple times (each time using a 
different optimization setting) in the inner loop, and then recompiling the 
source program and relinking the recompiled object files (outer loop). 

Integrating the compiler, linker, and the binary rewriter in the manner 
described above brings additional capabilities for code optimization. Also, the 
system flexibility of bundling together several evaluators gives the general 
framework the ability to take additional factors into consideration when 
selecting the best binary image. The system also provides a framework to 
better study the correlation between a particular optimization control and the 
cost implications that are brought as a result into the binary image. 

Turning now to Fig. 6, a block diagram of a computer is shown, on 
which a software tool, in accordance with an embodiment of the invention, can 
run, to perform the processes described above. The computer has a processor 
604 that is an example of a machine that can execute instructions stored in main 
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memory 606, toperform the operations described above. The main memory 
606 is a machine-readable medium that has stored therein an operating system 
607, and on top of which will run a code generator program 609, evaluators 104, 
116, and binary selector 110. The code generator program 609 may include one 
or more of the code generator components described above such as a compiler, 
linker, and binary rewriter. These programs may also be provided in other 
types of machine-readable media. The results of executing programs in main 
memory are displayed to the user, for example, displaying ranking of a number 
of input binaries by the binary selector 110, on a user display 616. The 
processor 604 accesses the user display 616 through an I/O interface 610, and a 
display controller 614. As to the I/O interface 610, this allows the processor 
and memory to communicate with additional devices, including, for instance, a 
keyboard or pointer 618, a mass storage 620 (e.g., a hard disk drive), and a 
network interface 624 over which the computer can be used to access other 
nodes of a network. Other arrangements of a computer for running the 
different software tools described above are possible. 

A machine-readable medium may include any mechanism for storing or 
. transmitting information (such as any one or more of the software components 
described above) in a form readable by a machine (e.g., a computer), not limited 
to Compact Disc Read-Only Memory (CD-ROMs), Read-Only Memory (ROMs), 
Random Access Memory (RAM), Erasable Programmable Read-Only Memory 
(EPROM), and a transmission over the Internet. 

The invention is not limited to the specific embodiments described 
above. For example, although shown to be in parallel the measurements of 
operations 404 and 406 may occur sequentially. In general, the order of the 
operations, as they are illustrated in Fig. 4 for example, may be changed in 
practice, depending on any given implementation of the overall process. For 
instance, the different versions of the binary may be produced in parallel by the 



14 



37 



same machine, using a multi-threading process. Accordingly, other 
embodiments are within the scope of the claims. 
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CLAIMS 

What is claimed is: 

1. A system for evaluating and selecting programming code, comprising: 
a first evaluator to measure a first characteristic of a plurality of input 

binaries and compute a plurality of first figures of merit for the plurality of . 
input binaries, respectively; and 

a binary selector to compare the plurality of first figure of merit and 
select one of the plurality of input binaries as having the highest or lowest 
overall figure of merit. 

2. The system of claim 1 further comprising: 

a second evaluator to measure a second characteristic of the plurality of 
input binaries and compute a plurality of second figures of merit for the 
plurality of input binaries, respectively, wherein 

the binary selector is to compare the plurality of second figures of merit. 

3 . The system of claim 2 wherein the binary selector is to compute an 
overall figure of merit for each of the input binaries as a function of the input 
binary's first and second figures of merit. 

4. The system of claim 2 further comprising: 

a third evaluator to measure a third characteristic of the plurality of 
input binaries and compute a plurality of third figures of merit for the plurality 
of input binaries, respectively, wherein 

the binary selector is to compare the plurality of third figures of merit. 

5. The system of claim 2 wherein the first characteristic is performance, the 
greater the performance the smaller its associated figure of merit, and the 
second characteristic is compressed file size, the smaller the compressed file 
size the smaller its associated figure of merit. 
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6. The system of claim 2 wherein the binary selector is to compare the 
plurality of first and second figures of merit by computing a mathematical 
operation for each of the input binaries which includes the respective first and 
second figures of merit of that input binary. 

7. The system of claim 6 wherein the first and second measured 
characteristics are selected from the group consisting of: performance, code 
size, power consumption, compressed file size, and memory footprint. 

8. The system of claim 7 further comprising: 

a code generator that includes a compiler and a linker to process an 
output of the compiler and produce the input binaries, wherein 

the compiler exposes an optimization control to its user selected from 
the group consisting of: loop-unrolling; vectorization; and constant 
propagation. 

9. The system of claim 8 wherein the code generator further comprises a 
binary rewriter to process an output of the linker, and produce the input 
binaries, wherein the binary rewriter exposes an optimization control to its user 
selected from the group consisting of: constant propagation; code shrinking; 
and specialization. 

10. The system of claim 8 further comprising: 

a script processor to process an input script from the user, the script 
processor to read a plurality of optimization combinations from the input script 
and configure the code generator in accordance with the optimization 

combinations, wherein 

the code generator is to produce the input binaries as configured by the 

optimization combinations, respectively. 

11. The system of claim 7 further comprising: 

a binary rewriter to produce the input binaries based on a source binary, 

wherein • 
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the binary rewriter is to expose optimization controls to its user. 

12. The system of claim 11 further comprising; 

a script processor to process an input script from the user, the script 
processor to read a plurality of optimization combinations from the input script 
and configure the binary rewriter in accordance with the optimization 

combinations, wherein 

the binary rewriter is to produce the input binaries as configured by the 

optimization combinations, respectively. 

13. A machine-implemented method for processing computer programming 
code, comprising: 

a) producing a current version of a binary using a current optimization 
setting; 

b) measuring a characteristic of the current version and computing a 
current figure of merit (FOM) associated with the current version; 

c) comparing the current FOM with a previously computed FOM 
associated with a prior version of the binary; and 

automatically repeating a)-c) for another optimization setting. 



14. The method of claim 13 further comprising: 

indicating to a user the version of the binary that has the highest or 
lowest FOM as determined from the comparisons. 

15. The method of claim 14 further comprising: 

ranking a plurality of versions of the binary in accordance with their 
respective FOMs as determined from the comparisons. 

16. The method of claim 13 wherein the current and another optimization 
settings include optimization controls for compilation, Unking, and binary 
rewriting, and wherein said producing comprises: 
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compiling source code and linking object files to produce an initial 
version of the binary, and rewriting the initial version into the current version, 
using the current optimization setting. 

17. The method of claim 13 wherein the current and another optimization 
settings include optimization controls for compilation and linking, and wherein 
said producing comprises: 

compiling source code and linking object files to produce the current 
version of the binary, using the current optimization setting. 

18. The method of claim 13 wherein the current and another optimization 
settings include optimization controls for binary rewriting, and wherein said 
producing comprises: 

rewriting an initial version of the binary into the current version, using 

the current optimization setting. 

19. The method of claim 13 further comprising: 

d) measuring another characteristic of the current version and 
computing another figure of merit (FOM) associated with said another 
characteristic and the current version; and 

e) comparing said another FOM with a previously computed FOM that 
is associated with said another characteristic and with a prior version of the 
binary. 

20. The method of claim 13 wherein the binary comprises a firmware driver, 
and the characteristic is compressed file size of the binary. 

21. An article of manufacture comprising: 

a machine-accessible medium containing instructions that, when 
executed, cause a machine to: 

a) generate a binary under an optimization setting; 

b) compute a cost as a function of a measured characteristic of the 
binary; 
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c) perform a) - b) a plurality of times each time with a different 
optimization setting but based on the same source program; and 

d) compare the computed costs, to select the binary having the lowest 
overall cost. 

22. The article of manufacture of claim 21 wherein the instructions cause the 
machine to perform a) - b) a plurality of times, by first compiling the source 
program and then rewriting the binary a plurality of times and then 
recompiling the source program and then rewriting the recompiled binary a 
plurality of times. 

23. The article of manufacture of claim 21 further comprising instructions 
that cause the machine to perform b) by computing a further cost as a function 
of a measured, further characteristic of the binary generated in a). 

24. The article of manufacture of claim 23 wherein the instructions cause the 
machine to compare the computed costs in d), by computing an overall cost for 
each generated binary, wherein the overall cost is a function of said cost and 
said further cost computed in b). 
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A system for evaluating and selecting programming code is described. 
A first evaluator measures a first characteristic of a number of input binaries. 
The evaluator computes a number of first figures of merit for the input binaries, 
respectively. A binary selector compares the first figures of merit, and selects 
one of the input binaries as having the highest or lowest overall figure of merit. 
Other embodiments are also described and claimed. 



21 



1 /5 




o 

£ o o 

s <3 





o 

I_L 













ooj 








CO 






i_ 






o 


we 






o 


Q_ 


CO 




E 


> 


=3 


UJ 




CO 




Con 



Ki/mm / 00 2 38 4 



2/5 




i 


i 


i— 




CD 






s\ 


c 


cnjI 


_J 




i 




i_ 




0) 




'a. 
E 


cnjI 


o 




O 





CD CD CD 

c c c 



0) 




a> 


*cL 




"5. 


E 


E 


E 


o 


o 


o 


o o o 




CNJ 


CO 


CD 


CD 


CO 


c 


C 


c 




» 




"5 


CD 


"55 


GO 


CO 


CO 



E 




CD 




i_ 




CD 




O 




\— 




CL 


s 


CD 


OJ 


a 








O 




CO 









Binary 2 


Binary 1 






CD 
O 



i 











o 






CO 






to 


SI 




CD 




u 


O 


col 


CO 


2 









I ; 


SI 








CO 


CD 


c 




CO 






CD 




cr 


i 


i 





CO 



is 



Pick Another 
Optimization 
Setting 



Produce A Current Version Of 
A Binary Using Current 
Optimization Setting 402 



Measure A Characteristic Of 

The Current Version And 
Compute A Current FOM 404 



Compare The Current Overall FOM 
With A Prior Overall FOM 408 



Measure Another Characteristic 
Of The Current Version, And 
Compute Another FOM 406 



Yes 



Any More 
Optimization Settings? 
412 



\ 


r 


+ 


Indicate To A User The Version 
Of The Binary That Has The 
Highest Or Lowest ("Best' 1 ) 
Overall FOM 414 




Rank The Versions Of The 
Binary According To 
Overall FOM 416 





FIG. 4 







i— 




o 


17$ 


*co 


o 


ZD 


O 


CO 


> ■ 



III 

z — < 



00 

o 

LO 




LO 

6 



CD 

o to 

3 =2 

9 LL- 



:izmzm / q o ?. 3 & & 



5/5 








o 




E 






81 






cz 




'to 
















o 








CO 




<D 


s l 


O 


col 


O 






Q_ 






CD 

o 



